Lip Motion Automatic Detection
نویسندگان
چکیده
An algorithm for speaker's lip motion detection is presented, based on the processing of a colour video sequence of speaker's face under natural lighting conditions and without any particular make-up. It is intended for applications in speech recognition, videoconferencing or speaker's face synthesis and animation. The algorithm is based on a statistical approach using Markov Random Field (MRF) modelling, with a spa-tiotemporal neighbourhood of the pixels in the image sequence. Two kinds of observations are used : the temporal diierence between successive images (motion information) and the purity of red hue in the current and past images (spatial information about lip location). The eld of hidden labels, relevant for lip motion detection, is obtained by energy minimisation and proves to be robust to lighting conditions (shadows). This label eld is used to extract qualitative information (mouth opening and closing) but also quantitative information by measuring some geometrical features (horizontal and vertical lip spacing) directly on the label eld.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملAutomatic detection of liver tumor motion by fluoroscopy images
Background: A method to track liver tumor motion signals from fluoroscopic images without any implanted gold fiducial markers was proposed in this study to overcome the adverse effects on precise tumor irradiation caused by respiratory movement. Materials and Methods: The method was based on the following idea: (i) Before treatment, a series of fluoroscopic images corresponding to different bre...
متن کاملChapter proposal for Visual Speech Recognition : Lip Segmentation and Mapping
The aim of this chapter is to examine the possibility of extracting prosodic information from lip features. We used two measurement techniques enabling automatic lip feature extraction to evaluate the “lip pattern” of prosodic focus in French. Two corpora with Subject-Verb-Object (SVO) sentences were designed. Four focus conditions (S, V, O or neutral) were elicited in a natural dialogue situat...
متن کاملRecognizing prosody from the lips
The aim of this chapter is to examine the possibility of extracting prosodic information from lip features. We used two measurement techniques enabling automatic lip feature extraction to evaluate the “lip pattern” of prosodic focus in French. Two corpora with Subject-Verb-Object (SVO) sentences were designed. Four focus conditions (S, V, O or neutral) were elicited in a natural dialogue situat...
متن کاملMotion adaptive model-assisted compatible coding with spatio-temporal scalability
We introduce the concept of Motion Adaptive Spatio-Temporal Model-Assisted Compatible (MA-STMAC) coding, a technique to selectively encode areas of di erent importance to the human eye in terms of space and time in moving images with the consideration of object motion. Previous STMAC was proposed based on the fact that human \eye contact" and \lip synchronization" are very important in person-t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997